Multi-GPU systems and Unified Virtual Memory for scientific applications: The case of the NAS multi-zone parallel benchmarks
نویسندگان
چکیده
• Multi-GPU and Unified Memory implementation of the Multi-Zone NAS Benchmarks. Analysis programmability performance effects Memory. per-GPU allocation have similar programming efforts. Unified-Memory version outperforms manual from 1.1x to 1.85x. GPU-based computing systems become a widely accepted solution for high-performance-computing (HPC) domain. GPUs shown highly competitive performance-per-watt ratios can exploit an astonishing level parallelism. However, exploiting peak such devices is challenge, mainly due combination two essential aspects multi-GPU execution: memory work distribution. determines data mapping GPUs, therefore conditions all distribution schemes communication phases in application. Virtual simplifies codification allocations, but its on depend how used by devices' driver going orchestrate transfers across system. In this paper we present (UM) Parallel Benchmarks which alternate computation offering opportunities overlap these phases. We analyse introduction UM support. Our experience shows that efforts introducing are those having per GPU. On evaluation environment composed 2 x IBM Power9 8335-GTH 4 GPU NVIDIA V100 (Volta), our UM-based parallelization versions 1.10x improvements sensitive information forwarded describing most convenient location specific regions. terms relationship between computational applications.
منابع مشابه
mesuring the staff technology readiness, the case of a multi national chemical company operating in iran
چکیده ندارد.
15 صفحه اولThe Nas Parallel Benchmarks
A new set of benchmarks has been developed for the performance evaluation of highly parallel supercomputers. These benchmarks consist of five parallel kernels and three simulated application benchmarks. Together they mimic the computation and data movement characteristics of large scale computational fluid dynamics (CFD) applications. The principal distinguishing feature of these benchmarks is ...
متن کاملCharacterizing Shared-Memory Applications: A Case Study of the NAS Parallel Benchmarks
The objective of this report is to present our characterization of a shared-memory implementation of the NAS Parallel Benchmarks (NPB). This characterization is needed to support the design decisions of future shared-memory multiprocessors. This report presents two sets of characterization data; the rst set is the application characteristics that do not change from one hardware connguration to ...
متن کاملVirtual machine workloads: the case for new benchmarks for NAS
Network Attached Storage (NAS) and Virtual Machines (VMs) are widely used in data centers thanks to their manageability, scalability, and ability to consolidate resources. But the shift from physical to virtual clients drastically changes the I/O workloads seen on NAS servers, due to guest file system encapsulation in virtual disk images and the multiplexing of request streams from different VM...
متن کاملthe survey of the virtual higher education in iran and the ways of its development and improvement
این پژوهش با هدف "بررسی وضعیت موجود آموزش عالی مجازی در ایران و راههای توسعه و ارتقای آن " و با روش توصیفی-تحلیلی و پیمایشی صورت پذیرفته است. بررسی اسنادو مدارک موجود در زمینه آموزش مجازی نشان داد تعداد دانشجویان و مقاطع تحصیلی و رشته محل های دوره های الکترونیکی چندان مطلوب نبوده و از نظر کیفی نیز وضعیت شاخص خدمات آموزشی اساتید و وضعیت شبکه اینترنت در محیط آموزش مجازی نامطلوب است.
ذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Parallel and Distributed Computing
سال: 2021
ISSN: ['1096-0848', '0743-7315']
DOI: https://doi.org/10.1016/j.jpdc.2021.08.001